Closed captioning language translation

ABSTRACT

The present invention provides an architecture for translating closed captioning text originally provided with a video program from one language to another and presenting the translated closed captioning text with the video program to a viewer. As such, the viewers are able to receive the closed captioning text in languages other than that used for the closed captioning originally provided with the video program. The original closed captioning text may be translated from one language to another by a centralized closed captioning processor, such that the customer equipment for various subscribers can take advantage of centralized translation services. Once, the original closed captioning text is translated, the translated closed captioning text may be delivered to the customer equipment in different ways.

RELATED APPLICATIONS

The present application is a continuation of co-pending U.S. patent application Ser. No. 11/531,562, filed Sep. 13, 2006, entitled “CLOSED CAPTIONING LANGUAGE TRANSLATION.”

FIELD OF THE INVENTION

The present invention relates to closed captioning, and in particular to translating closed captioning text provided in a first language into a second language, wherein the translated closed captioning text is presented, to a viewer along with the corresponding video.

BACKGROUND OF THE INVENTION

Closed captioning allows deaf, hard of hearing, and hearing impaired people to read a transcript of an audio portion of a television, video, or film presentation. As the video is presented to the viewer, text captions are displayed identifying who is speaking, transcribing what is being said, and indicating relevant sounds, such as laughing, crying, crashes, explosions, and the like. Closed captioning is also used to assist people who are learning an additional language, learning to read, or for those in a noisy environment.

For the present disclosure, television, video, and film presentations are referred to as “video,” and the text captions representing the closed captioning of the video are referred to as “closed captioning text.” Closed captioning text is encoded into the video using any number of closed captioning techniques. In many instances, different types of video programming employ different types of closed captioning encoding and decoding.

In North America, National Television Systems Committee (NTSC)-based programming encodes closed captioning text into line 21 of the vertical blanking interval. The vertical blanking interval is a portion of the analog television picture that resides just above the visible portion of the video, and is not seen by the viewer. The viewer's set-top box or television is able to decode the encoded closed captioning text provided in line 21 of the vertical blanking interval and present it to the viewer in association with the video. For digital television, the Advanced Television Systems Committee (ATSC)-based programming encodes three data streams into the video to support closed captioning. One of the streams can support up to 63 unique closed captioning streams, which are encoded in an EIAA-708 format as set forth by the Electronic Industries Alliance (EIA). The other two streams are encoded such that when the digital video is converted to analog video, the closed captioning text appears as encoded closed captioning in line 21 of the vertical blanking interval. Outside of North America, Phase Alternation Line (PAL) and Sequential Color With Memory (SCCAM) video standards transmit and store closed captioning information in a different manner, but the overall result is the same.

Regardless of the encoding and delivery technique, closed captioning is extremely beneficial in providing a transcript of an audio portion of a video program. Unfortunately, closed captioning text is generally only available in one language, although most closed captioning standards support different closed captioning streams for different languages. However, the significant effort and expense associated with providing closed captioning generally limits the closed captioning text to the most prevalent language in which the video will be presented. In the United States, for example, closed captioning is generally only provided in English, even though there are significant Hispanic, Asian, and European contingents who would benefit from closed captioning in their native languages.

Accordingly, there is a need for a way to efficiently and effectively translate closed captioning text presented in a first language into a second language, and make the translated closed captioning text available to viewers of the associated video.

SUMMARY OF THE INVENTION

The present invention provides an architecture for translating closed captioning text originally provided with a video program from one language to another and presenting the translated closed captioning text with the video program to a viewer. As such, the viewers are able to receive the closed captioning text in a language other than that used for the closed captioning originally provided with the video program. The original closed captioning text may be translated from one language to another by a centralized closed captioning processor, such that the customer equipment for various subscribers can take advantage of centralized translation services. Once the original closed captioning text is translated, the translated closed captioning text may be delivered to the customer equipment in different ways.

In a first embodiment, the video program is sent to the closed captioning processor and the customer equipment at the same time. The closed captioning processor will translate the original closed captioning text from one language to another. After translation, the closed captioning processor will deliver the translated closed captioning text to the customer equipment, which will present the translated closed captioning text with the video program.

In another embodiment, the video program is initially sent to the closed captioning processor. The closed captioning processor will translate the original closed captioning text from one language to another. After translation, the closed captioning processor will encode the translated closed captioning text into the video program as closed captioning. The closed captioning processor will then deliver the video program with the new closed captioning content to the customer equipment, which will recover the translated closed captioning text using traditional closed captioning decoding techniques.

Those skilled in the art will appreciate the scope of the present invention and realize additional aspects thereof after reading the following detailed description of the preferred embodiments in association with the accompanying drawing figures.

BRIEF DESCRIPTION OF THE DRAWING FIGURES

The accompanying drawing figures incorporated in and forming a part of this specification illustrate several aspects of the invention, and together with the description serve to explain the principles of the invention.

FIG. 1 is a block representation of a closed captioning architecture according to a first embodiment of the present invention.

FIG. 2 is a block representation of a closed captioning translation process according to the embodiment of FIG. 1.

FIG. 3 is a flow diagram illustrating the closed captioning translation process according to the embodiment of FIG. 1.

FIG. 4 is a block representation of a closed captioning architecture according to a second embodiment of the present invention.

FIG. 5 is a block representation of, a closed captioning translation process according to the embodiment of FIG. 4.

FIG. 6 is a flow diagram illustrating the closed captioning translation process according to the embodiment of FIG. 4.

FIG. 7 is a block representation of a closed captioning processor according to one embodiment of the present invention.

FIG. 8 is a block representation of customer equipment according to one embodiment of the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The embodiments set forth below represent the necessary information to enable those skilled in the art to practice the invention and illustrate the best mode of practicing the invention. Upon reading the following description in light of the accompanying drawing figures, those skilled in the art will understand the concepts of the invention and will recognize applications of these concepts not particularly addressed herein. It should be understood that these concepts and applications fall within the scope of the disclosure and the accompanying claims.

The present invention provides an architecture for translating closed captioning (CC) text originally provided with a video program from one language to another and presenting the translated closed captioning text with the video program to a viewer. As such, the viewers are able to receive the closed captioning text in a language other than that used for the closed captioning originally provided with the video program. The original closed captioning text may be translated from one language to another by a centralized closed captioning processor, such that the customer equipment for various subscribers can take advantage of centralized translation services. Once the original closed captioning text is translated, the translated closed captioning text may be delivered to the customer equipment in different ways. In a first embodiment, the video program is sent to the closed captioning processor and the customer equipment at the same time. The closed captioning processor will translate the original closed captioning text from one language to another. After translation, the closed captioning processor will deliver the translated closed captioning, text to the customer equipment, which will present the translated closed captioning text with the video program.

In another embodiment, the video program is initially sent to the closed captioning processor. The closed captioning processor will translate the original closed captioning text from one language to another. After translation, the closed captioning processor will encode the translated closed captioning text into the video program as closed captioning. The closed captioning processor will then deliver the video program with the new closed captioning content to the customer equipment, which will recover the translated closed captioning text using traditional closed captioning decoding techniques.

In either embodiment, any number of translations may be provided for original closed captioning text. The translation process and the desired translation or translations may be requested by the customer equipment, the service provider, or another subscriber device. The video program may be provided in an analog or digital format via broadcast or recorded medium.

With reference to FIG. 1, a closed captioning translation architecture is illustrated according to a first embodiment of the present invention. The closed captioning translation architecture 10 includes a video source 12 capable of delivering video programs to various customer equipment 14, which may be associated with different subscribers at different locations. The customer equipment 14 may take various forms, including a set-top box (STB) 14A, an associated television or monitor 14B, a personal computer (PC) 14C, as well as any other mobile device, such as a personal digital assistant (PDA) 14D or mobile telephone (not shown). The video programs that require closed captioning translation are also sent to a closed captioning (CC) processor 16 at the same time they are being sent to the customer equipment 14.

In FIG. 1, video programs are delivered to the customer equipment 14 and the CC processor 16 via an appropriate video signal 18, which provides closed captioning in a first language. The CC processor 16 will translate the closed captioning text to a second language and will provide corresponding translated CC information 20 to the appropriate customer equipment 14, as indicated above. To reduce the computational requirements of the customer equipment 14, and to minimize delays associated with translating closed captioning text from one language to another, the CC processor 16 is provided in a central network 22. As such, the CC processor 22 is separate from the customer equipment 14. Preferably, the CC processor 16 is able to provide translation services to customer equipment 14 at numerous customer premises 24.

FIG. 2 illustrates a simplified block representation of the closed captioning translation architecture of the embodiment illustrated in FIG. 1. As illustrated, a video source 12 will simultaneously provide the video signal 18 for the video program to the CC processor 16 and the customer equipment 14. The video signal 18 will have a video, audio, and closed captioning component having closed captioning text (CC1), regardless of the signal type or closed captioning encoding technique.

With reference to FIG. 3, operation of the CC processor 16 is illustrated. The CC processor 16 may receive a CC translation request from the customer equipment 14 to identify a language to which the closed captioning text (CC1) provided in the video signal 18 should be translated (step 100). The CC processor 16 will then receive the video signal 18 with the original closed captioning for the video program (step 102), and extract the original closed captioning text (CC1) from the video signal 18 using the appropriate closed captioning decoding techniques (step 104). Once the original dosed captioning text (CC1) is extracted, the CC processor 16 will translate the original closed captioning text from one language to another to create translated closed captioning text (CC2) (step 106). The translated closed captioning text is delivered to the customer equipment 14 in translated closed captioning information 20, which is based on the translated, closed captioning text (step 108), as also illustrated in FIG. 2.

Translation of the closed captioning text may be provided using different techniques. For example, the translation may be provided on a word-by-word or phrase-by-phrase basis. Alternatively, the closed captioning text over a certain period of time may be accumulated and then translated as an entire segment. The particular type of translation is beyond the scope of the present invention, and those skilled in the art will recognize various translation techniques that will be beneficial in various video delivery environments.

In one embodiment, the translated closed captioning information is not re-encoded into a closed captioning format, and is sent to the customer equipment 14 without the video or audio components of the video program. The translated closed captioning information 20 is formatted such that the customer equipment 14 can readily recover the translated closed captioning text (CC2) and overlay the text on the video program being presented to the viewer. The overlay procedure may be configured to emulate traditional closed captioning, or may be presented in any desired fashion. Notably, the customer equipment 14 need not have closed captioning decoding capabilities, since the translated closed captioning information is not necessarily provided in a closed captioning format, although it represents a translation of the original closed captioning text (CC1).

Since the translated closed captioning text (CC2) is being delivered to the customer equipment 14 in the translated closed captioning information 20 separately from the video signal 18, steps must be taken to synchronize the presentation of the translated closed captioning text (CC2) with the video of the video program. In the translated closed captioning information 20, the CC processor 16 may provide markers or like synchronization information corresponding to the video in the video program, such that the translated closed captioning text is presented at the appropriate time and rate in association with the video of the video program. Alternatively, the customer equipment 14 and the CC processor 16 may periodically or continuously communicate with each other to ensure that the translated closed captioning text (CC2) is presented to the viewer along with the video of the video program in a synchronized fashion.

The CC processor 16 will inevitably inject some delay in presenting the translated closed captioning text (CC2) to the customer equipment 14. The customer equipment 14 may employ at least a video buffer to buffer the video of the video program for a time sufficient to receive the translated closed captioning text (CC2) from the CC processor 16. The synchronization information in the closed captioning text will control the amount of buffering. The customer equipment 14 will then present the translated closed captioning text (CC2) and the video of the video program in a synchronized fashion to the viewer.

Given the centralized nature of the CC processor 16, any customer equipment 14 may receive translation services. Further, the CC processor 16 may be configured to translate between any number of languages, such that closed captioning text may be translated into any number of languages and may be presented to any amount of customer equipment 14 in an effective and efficient manner. Preferably, the customer equipment 14 is able to request a particular type of translation for a particular video program or for all programming in light of the viewer's needs or desires.

With reference to FIG. 4, an alternative closed captioning translation architecture is provided. As illustrated, the video signal 18 is sent to the CC processor 16 from the video source 12. The video signal 18 is not directly delivered to the customer equipment 14 when closed captioning translation services are employed. Instead, the CC processor 16 will extract the closed captioning text from the video signal 18, translate the closed captioning text from one language to another, and encode translated closed captioning text into the video signal 18 as traditional closed captioning. The CC processor 16 will then deliver a video signal 26 having closed captioning with the translated closed captioning text to the customer equipment 14.

This process is further illustrated in FIG. 5. Notably, the CC processor 16 presents video, audio, and closed captioning aspects of the video program in the video signal 26 to the customer equipment 14. Notably, the translated closed captioning text (CC2) may replace the closed captioning text (CC1) that was provided in the original video signal 18, or may be added in a separate closed captioning field, as illustrated. Upon receipt of the video signal 26, the customer equipment 14 may use traditional closed captioning decoding techniques to present the translated closed captioning text CC2 or original closed captioning text CC1 to the viewer. Unlike the previous embodiment where the translated closed captioning text (CC2) required a different process to overlay the translated closed captioning text on the video of the video program, this embodiment embeds translated closed captioning text into the video signal 26 prior to the customer equipment 14 receiving the video signal 26. This embodiment is particularly beneficial in environments where the customer equipment 14 is not configured to receive translated closed captioning information via a separate source and overlay the corresponding translated closed captioning text (CC2) over the video of the video signal 18. However, this embodiment requires that the customer equipment 14 have closed captioning decoding capabilities.

With reference to FIG. 6, operation of the CC processor 16 is provided according to the second embodiment. Initially, the CC processor 16 may receive closed captioning translation instructions from the customer equipment 14, a service provider, or a subscriber via another piece of customer equipment 14 (step 200). The instructions may identify a desired language for closed captioning, as well as the video program or programs for which closed captioning translation services are desired. The CC processor 16 may then receive a video signal with closed captioning for a video program (step 202). Again, the closed captioning will include closed captioning text (CC1) in a first language. The CC processor 16 will extract the closed captioning text (CC1) (step 204) and translate the closed captioning text (CC1) into translated closed captioning text (CC2) (step 206). The translated closed captioning text is then encoded into the video signal 18 of the video program using appropriate closed captioning encoding techniques (step 208). The CC processor 16 will deliver the resultant video signal 26 to the customer equipment 14 (step 210). Again, the video signal 26 will have closed captioning including the translated closed captioning text (CC2), and perhaps the original closed captioning text (CC1).

In this embodiment, synchronization of the translated closed captioning text (CC2) with the video of the video program is provided in the CC processor 16 during the closed captioning encoding process. The customer equipment 14 will simply decode the appropriate closed captioning stream, which includes the translated closed captioning text (CC2), and present the translated closed captioning text (CC2) to the viewer along with the video in traditional fashion.

Those skilled in the art will recognize that different closed captioning encoding and decoding techniques are available and known in the art. In light of the different closed captioning processes and the various manners in which video programs may be recorded, broadcast, or delivered to customer equipment 14, the concepts of the present invention may take corresponding forms, as will be appreciated by those skilled in the art.

With reference to FIG. 7, a block representation of a CC processor 16 is illustrated. The CC processor 16 may include a control system 30 associated with sufficient memory 32 for the requisite software 34 and data 36 to operate as described above. The control system 30 may also be associated with one or more communication interfaces 38 to facilitate communications directly or indirectly with the video source 12 and the customer equipment 14.

A block representation of customer equipment 14 is provided in FIG. 8. The customer equipment 14 may include a control system 40 having sufficient memory 42 for the requisite software 44 and data 46 to operate as described above. The control system 40 may also be associated with one or more communication interfaces 48 to facilitate communications with other communication equipment 14, the CC processor 16, and the video source 12, in a direct or indirect fashion. The control system 40 may also be associated with a user interface 50, which may represent a display, monitor, keypad, mouse, remote control input, or the like.

Those skilled in the art will recognize improvements and modifications to the preferred embodiments of the present invention. All such improvements and modifications are considered within the scope of the concepts disclosed herein and the claims that follow. 

1. A video display system, comprising: a user interface operable to receive a user input requesting information presentation in a user-specified format; at least one communication interface operable to: in response to the user input, transmit a request for information presentation in the user-specified format over a communications network to a remote source of requested information; and receive a video signal and at least one requested information signal over a communications network from at least one remote signal source, the at least one requested information signal comprising information in the user-specified format; a video display; and a control system operable to present the video signal and the at least one requested information signal in the user-specified format on the video display.
 2. The video display system of claim 1, wherein the requested information is textual information.
 3. The video display system of claim 2, wherein the textual information is closed captioning.
 4. The video display system of claim 1, wherein the user-specified format is a user-specified language.
 5. The video display system of claim 1, wherein the at least one communication interface comprises a video interface operable to receive the video signal from a remote video signal source.
 6. The video display system of claim 5, wherein the video interface is operable to receive the video signal and the at least one requested information signal from the remote video signal source.
 7. The video display system of claim 6, wherein the at least one requested information signal is incorporated in the video signal.
 8. The video display system of claim 6, wherein the remote video signal source is the remote source of the requested information.
 9. The video display system of claim 6, wherein the at least one communication interface further comprises at least one requested information signal interface operable to receive the at least one requested information signal from the remote source of the requested information, the at least one requested information signal comprising the requested information in the user-specified format.
 10. The video display system of claim 1, wherein the control system is operable to synchronize the requested information in the user-specified format with video information from the video signal.
 11. The video display system of claim 10, wherein the control system is operable to use synchronization information in the at least one requested information signal to synchronize the requested information in the user-specified format with the video information.
 12. The video display system of claim 10, wherein the control system is operable to communicate with the remote source of the requested information over the at least one requested information signal interface to synchronize the requested information in the user-specified format with the video information.
 13. The video display system of claim 10, further comprising a video buffer controlled by the control system to synchronize the requested information with the video information.
 14. A method of operating a video display system, comprising: receiving a user input requesting information presentation in a user-specified format in response to, the user input, transmitting a request for information presentation in the user-specified format over a communications network to a remote source of requested information; and receiving a video signal and at least one requested information signal over a communications network from at least one remote signal source, the at least one requested information signal comprising information in the user-specified format; presenting the video signal and the requested information in the user-specified format on a video display.
 15. The method of claim 14, wherein the requested information is textual information.
 16. The method of claim 15, wherein the textual information is closed captioning.
 17. The method of claim 14, wherein the user-specified format is a user-specified language.
 18. The method of claim 14, further comprising synchronizing the requested information in the user-specified format with video information from the video signal.
 19. The method of claim 14, comprising using synchronization information in the at least one requested information signal to synchronize the requested information in the user-specified format with the video information.
 20. The method of claim 14, comprising communicating with the remote source of the requested information to synchronize the requested information in the user-specified format with the video information.
 21. An apparatus for providing requested information in a user-requested format system, comprising: at least one communication interface operable to receive a request for information presentation in a user-specified format over a communications network from a remote user video display system; and a control system operable to provide a requested information signal comprising requested information in the user-specified format; the at least one communication interface being further operable to transmit the requested information signal to the remote user video display system.
 22. The apparatus of claim 21, wherein the control system is operable to provide the requested information signal by generating the requested information signal in response to the received request.
 23. The apparatus of claim 21, wherein the requested information is textual information.
 24. The apparatus of claim 21, wherein the user-specified format is a user-specified language.
 25. The apparatus of claim 24, wherein the control system is operable to translate an information signal into the user-specified language in response to the request.
 26. The apparatus of claim 21, wherein the at least one communication interface is further operable to receive at least one video signal and to transmit at least one video signal to the remote user video display system.
 27. The apparatus of claim 26, wherein the control system is operable to incorporate the requested information signal in an associated video signal, and the communication interface is operable to transmit the video signal with the incorporated requested information signal to the remote user video display system.
 28. The apparatus of claim 26, wherein the control system is operable to provide the requested information signal by generating the requested information signal from a selected received video signal in response to the received request.
 29. The apparatus of claim 28, wherein the user-specified format is a user-specified language, and the control system is operable to translate an information signal into the user-specified language in response to the request.
 30. The apparatus of claim 29, wherein the requested information is textual information.
 31. A method of providing requested information in a user-requested format system from a central information source, the method comprising: receiving a request for information presentation in a user-specified format over a communications network from a remote user video display system; generating a requested information signal comprising the requested information in the user-specified format; and transmitting the requested information signal to the remote user video display system.
 32. The method of claim 31, wherein generating the requested information signal comprises generating the requested information signal in response to the request.
 33. The method of claim 31, wherein the requested information is textual information.
 34. The apparatus of claim 31, wherein the user-specified format is a user-specified language.
 35. The method of claim 34, wherein generating the requested information signal comprises translating an information signal into the user-specified language in response to the received request.
 36. The method of claim 31, wherein further comprising receiving at least one video signal and transmitting at least one video signal to the remote user video display system.
 37. The method of claim 36, comprising incorporating the requested information signal in an associated video signal, and transmitting the associated video signal with the incorporated requested information signal to the remote user video display system.
 38. The method of claim 36, wherein generating the requested information signal comprises generating the requested information signal from a selected received video signal in response to the request.
 39. The method of claim 38, wherein the user-specified format is a user-specified language, and generating the requested information signal comprises translating an information signal into the user-specified language in response to the request.
 40. The method of claim 39, wherein the requested information is textual information. 